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Abstract 

The minimum-cost flow problem is a classic problem in combinatorial optimization 
with various applications. Several pseudo-polynomial, polynomial, and strongly poly¬ 
nomial algorithms have been developed in the past decades, and it seems that both the 
problem and the algorithms are well understood. However, some of the algorithms’ 
running times observed in empirical studies contrast the running times obtained by 
worst-case analysis not only in the order of magnitude but also in the ranking when 
compared to each other. For example, the Successive Shortest Path (SSP) algorithm, 
which has an exponential worst-case running time, seems to outperform the strongly 
polynomial Minimum-Mean Cycle Canceling algorithm. 

To explain this discrepancy, we study the SSP algorithm in the framework of 
smoothed analysis and establish a bound of 0{mn(j)) for the number of iterations, 
which implies a smoothed running time of 0{mn4>{m+n logn)), where n and m denote 
the number of nodes and edges, respectively, and 0 is a measure for the amount of 
random noise. This shows that worst-case instances for the SSP algorithm are not 
robust and unlikely to be encountered in practice. Furthermore, we prove a smoothed 
lower bound of H(m • min {n, </>} • cf>) for the number of iterations of the SSP algorithm, 
showing that the upper bound cannot be improved for <f> = fi(n). 


1 Introduction 

Flow problems have gained a lot of attention in the second half of the twentieth cen¬ 
tury to model, for example, transportation and communication networks CHS]. Plenty 
of algorithms have been developed over the last fifty years. The first pseudo-polynomial 
algorithm for the minimum-cost flow problem was the Out-of-Kilter algorithm indepen¬ 
dently proposed by Minty m and by Fulkerson CD]- The simplest pseudo-polynomial 
algorithms are the primal Cycle Canceling algorithm by Klein m and the dual Succes¬ 
sive Shortest Path (SSP) algorithm by Jewell P~3], Iri P2j, and Busacker and Gowen [5]- 
By introducing a scaling technique Edmonds and Karp [5] modified the SSP algorithm to 
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obtain the Capacity Scaling algorithm, which was the first polynomial time algorithm for 
the minimum-cost flow problem. 

The first strongly polynomial algorithms were given by Tardos |25| and by Orlin |20j. 
Later, Goldberg and Tarjan hd proposed a pivot rule for the Cycle Canceling algorithm to 
obtain the strongly polynomial Minimum-Mean Cycle Canceling (MMCC) algorithm. The 
fastest known strongly polynomial algorithm up to now is the Enhanced Capacity Scaling 
algorithm due to Orlin m and has a running time of 0(mlog(n)(m + nlogn)), where n 
and m denote the number of nodes and edges, respectively. For an extensive overview 
of minimum-cost flow algorithms we suggest the paper of Goldberg and Tarjan [12], the 
paper of Vygen [27] , and the book of Ahuja, Magnanti, and Orlin jTj. 

Zadeh [25] showed that the SSP algorithm has an exponential worst-case running time. 
Contrary to this, the worst-case running times of the Capacity Scaling algorithm and the 
MMCC algorithm are 0(m(\og U) (m + nlogn)) |8l and 0(m 2 n 2 min{log(nC'), m}) [22], 
respectively. Here, U denotes the maximum edge capacity and C denotes the maximum 
edge cost. In particular, the former is polynomial whereas the latter is even strongly 
polynomial. However, the notions of pseudo-polynomial, polynomial, and strongly poly¬ 
nomial algorithms always refer to worst-case running times, which do not always resemble 
the algorithms’ behavior on real-life instances. Algorithms with large worst-case running 
times do not inevitably perform poorly in practice. An experimental study of Kovacs m 
indeed observes running time behaviors significantly deviating from what the worst-case 
running times indicate. The MMCC algorithm is completely outperformed by the SSP 
algorithm. The Capacity Scaling algorithm is the fastest of these three algorithms, but 
its running time seems to be in the same order of magnitude as the running time of the 
SSP algorithm. In this article, we explain why the SSP algorithm comes off so well by 
applying the framework of smoothed analysis. 

Smoothed analysis was introduced by Spielman and Teng [23] to explain why the 
simplex method is efficient in practice despite its exponential worst-case running time. 
In the original model, an adversary chooses an arbitrary instance which is subsequently 
slightly perturbed at random. In this way, pathological instances no longer dominate 
the analysis. Good smoothed bounds usually indicate good behavior in practice because 
in practice inputs are often subject to a small amount of random noise. For instance, 
this random noise can stem from measurement errors, numerical imprecision, or rounding 
errors. It can also model influences that cannot be quantified exactly but for which there 
is no reason to believe that they are adversarial. Since its invention, smoothed analysis has 
been successfully applied in a variety of contexts. Two recent surveys US] EH summarize 
some of these results. 

We follow a more general model of smoothed analysis due to Beier and Vocking [2]. 
In this model, the adversary is even allowed to specify the probability distribution of the 
random noise. The power of the adversary is only limited by the smoothing parameter cj). 
In particular, in our input model the adversary does not fix the edge costs c e G [0,1], but 
he specifies for each edge e a probability density function f e : [0,1] —> [0, (j>] according to 
which the costs c e are randomly drawn independently of the other edge costs. If cj) = 1, 
then the adversary has no choice but to specify a uniform distribution on the interval [0,1] 
for each edge cost. In this case, our analysis becomes an average-case analysis. On the 
other hand, if cj) becomes large, then the analysis approaches a worst-case analysis since 
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the adversary can specify a small interval I e of length 1/0 (which contains the worst-case 
costs) for each edge e from which the costs c e are drawn uniformly. 

As in the worst-case analysis, the network graph, the edge capacities, and the balance 
values of the nodes are chosen adversarially. The edge capacities and the balance values 
of the nodes are even allowed to be real values. We define the smoothed running time of 
an algorithm as the worst expected running time the adversary can achieve and we prove 
the following theorem. 


Theorem 1. The SSP algorithm requires 0(mnf>) augmentation steps in expectation and 
its smoothed running time is 0{mn(j){m + n log n)). 


If 0 is a constant - which seems to be a reasonable assumption if it models, for example, 
measurement errors - then the smoothed bound simplifies to 0{mn{m + n log n)). Hence, 
it is unlikely to encounter instances on which the SSP algorithm requires an exponential 
amount of time. 

The following theorem, which we also prove in this article, states that the bound for 
the number of iterations of the SSP algorithm stated in Theorem [l] cannot be improved 
for 0 = H(n). 

Theorem 2. For given positive integers n, m G {n,... ,n 2 }, and 0 < 2 n there exists a 
minimum-cost flow network with 0(n) nodes, 0(m ) edges, and random edge costs with 
smoothing parameter 0 on which the SSP algorithm requires kl(m • min {n, 0} • 0) augmen¬ 
tation steps with probability 1. 


The main technical section of this article is devoted to the proof of Theorem [I] (Sec¬ 


tion [dj. In Section 
this article (Section 
analysis to the simp 


we derive the lower bound stated in Theorem [2j At the end of 
6), we point out some connections between SSP and its smoothed 
ex method with the shadow vertex pivot rule, which has been used 


by Spielman and Teng in their smoothed analysis 


1.1 The Minimum-Cost Flow Problem 

A flow network is a simple directed graph G = (V, E) together with a capacity func¬ 
tion u: E —» M>o- For convenience, we assume that there are no directed cycles of length 
two. In the minimum-cost flow problem there are an additional cost function c: E —> [0,1] 
and a balance function b: V —>• M indicating how much of a resource some node v re¬ 
quires (b(v) < 0) or offers (b(v) >0). A feasible b-flow for such an instance is a func¬ 
tion /: E —> M>o that obeys the capacity constraints 0 < f e < u e for any edge e € E and 
Kirchhoff’s law adapted to the balance values, i.e., b(v) + J2 e =(u,v)eE fe = J2 e '=(v,w)eE fe' 
for all nodes v E V. (Even though u, c, and / are functions, we use the notation u e , c e , 
and f e instead of u(e), c(e), and /(e) in this article.) If J2 v evK v ) 7^ 0, then there does 
not exist a feasible 6-flow. We therefore always require J2 v eV K v ) = 0- The cost of a 
feasible 6-flow is defined as c(/) = JfeeE fe ' °e■ In the minimum-cost flow problem the 
goal is to find the cheapest feasible 6-flow, a so-called minimum-cost b-flow, if one exists, 
and to output an error otherwise. 
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1.2 The SSP Algorithm 

For a pair e = (u, v), we denote by e _1 the pair (v, u ). Let G be a flow network, let c be 
a cost function, and let / be a flow. The residual network Gf is the directed graph with 
vertex set V, arc set E' = Ef U E^, where 

Ef = {e : e 6 E and f e < u e } 

is the set of so-called forward arcs and 

Eh = {e -1 : e E E and f e > 0} 

is the set of so-called backward arcs, a capacity function v !: E' —»• M, defined by 


u 


t 

e 


U e - fe if e E E , 
fe- 1 ife _1 G£\ 


and a cost function d: E' —> M, defined by 


c 


/ 

e 


c e if e £ -E , 

—c e -i if e _1 € E . 


In practice, the simplest way to implement the SSP algorithm is to transform the instance 
to an equivalent instance with only one supply node (a node with positive balance value) 
and one demand node (a node with negative balance value). For this, we add two nodes s 
and t to the network which we call master source and master sink , edges (s, v) for any 
supply node v, and edges (w, t ) for any demand node w. The capacities of these auxiliary 
edges ( s , v) and (w, t) are set to b(v) > 0 and —b(w) > 0, respectively. The costs of 
the auxiliary edges are set to 0. Now we set b(s) = — b(t ) = 2 where z is the sum of the 
capacities of the auxiliary edges incident with s (which is equal to the sum of the capacities 
of the auxiliary edges incident with t due to the assumption that J2 v &v^( v ) = 0)- All 
other balance values are set to 0. 

This is a well-known transformation of an arbitrary minimum-cost flow instance into 
a minimum-cost flow instance with only a single source s, a single sink t, and b{v) = 0 for 
all nodes v G V \ {s,f}. Nevertheless, we cannot assume without loss of generality that 
the flow network we study has only a single source and a single sink. The reason is that in 
the probabilistic input model introduced above it is not possible to insert auxiliary edges 
with costs 0 because the costs of each edge are chosen according to some density function 
that is bounded from above by <f>. We have to consider the auxiliary edges with costs 0 
explicitly and separately from the other edges in our analysis. 

The SSP algorithm run on the transformed instance computes the minimum-cost b- 
flow for the original instance. In the remainder of this article we use the term flow to 
refer to a feasible 6-flow for an arbitrary 6 with b(s) = — b(t ) and b(v) = 0 for v tf {s,f}. 
We will denote by |/| the amount of flow shipped from s to t in flow /, i.e., |/| = 

Se=(s,»)£E fe Ije=(»,s)£E /e - 

The SSP algorithm for a minimum-cost flow network with a single source s, a single 
sink t, and with b(s) = —6(f) = z > 0 is given as Algorithm [lj 
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Algorithm 1 SSP for single-source-single-sink minimum-cost flow networks with b(s) = 
—b(t ) = z > 0. 

1: start with the empty flow /o = 0 
2: for i = 1 , 2 ,... do 

3: if Gf z _ , does not contain a (directed) s-t path then output that there does not 

exist a flow with value 2 

4: find a shortest s-t path Pi in Gf i l with respect to the arc costs 

5: augment the flow as much as possible* along path Pi to obtain a new flow fi 

6: if \fi\ = z then output fi 

7: end for 

* Since the value \fi\ of flow fi must not exceed z and the flow fi must obey all capacity constraints, 
the flow is increased by the minimum of min{it e — fi- i(e) | e G P* D E}, min{/j_i(e) | e £ Pi A e"e 
E} and z - |/*i|. 


Theorem 3. In any round i, flow fi is a minimum-cost bi-flow for the balance function bi 
defined by bfls ) = —bflt) = \fi\ and bflv) = 0 for v fz {s,i}. 

Theorem[3]is due to Jewell M, Iri m, and Busacker and Gowen [5] • We refer to Korte 
and Vygen EH for a proof. As a consequence, no residual network Gf i contains a directed 
cycle with negative total costs. Otherwise, we could augment along such a cycle to obtain 
a bi- flow f with smaller costs than fi. In particular, this implies that the shortest paths 
in Gfi from s to nodes form a shortest path tree rooted at s. Since the choice of 

the value z only influences the last augmentation of the algorithm, the algorithm performs 
the same augmentations when run for two different values z\ < Z 2 until the flow value \fi\ 
exceeds z\. We will exploit this observation in Lemma [9j 

Note that one could allow the cost function c to have negative values as well. As long 
as the network does not contain a cycle with negative total costs, the SSP algorithm is 
still applicable. However, as we cannot ensure this property if the edge costs are random 
variables, we made the assumption that all edge costs are non-negative. 

1.3 A Connection to the Integer Worst-case Bound 

We can concentrate on counting the number of augmenting steps of the SSP algorithm since 
each step can be implemented to run in time 0(m + nlogn) using Dijkstra’s algorithm. 
Let us first consider the case that all edge costs are integers from {1,..., C} . In this 
case the length of any path in any residual network is bounded by nC. We will see that 
the lengths of the augmenting paths are monotonically increasing. If there is no unique 
shortest path to augment flow along and ties are broken by choosing one with the fewest 
number of arcs, then the number of successive augmenting paths with the same length 
is bounded by 0(mn ) (this follows from the analysis of the Edmonds-Karp algorithm for 
computing a maximum flow Hence, the SSP algorithm terminates within 0(mn 2 C) 
steps. 

Now let us perturb the edge costs of such an integral instance independently by, for 
example, uniform additive noise from the interval [—1,1]. This scenario is not covered by 
bounds for the integral case. Indeed, instances can be generated with positive probability 


5 







for which the number of augmentation steps is exponential in m and n. Nevertheless, an 
immediate consequence of Theorem[l]is that, in expectation, the SSP algorithm terminates 
within 0(mnC) steps on instances of this form. 

2 Terminology and Notation 

Consider the run of the SSP algorithm on the flow network G. We denote the set 
{/o,/i,...} of all flows encountered by the SSP algorithm by Eq(G). Furthermore, we 
set E(G) = Fq(G) \ {/o}. (We omit the parameter G if it is clear from the context.) 

Let us remark that we have not specified in Algorithm [l] which path is chosen if the 
shortest s-t path is not unique. This is not important for our analysis because we will see 
in Section [4] that this happens only with probability 0 in our probabilistic model. We can 
therefore assume Eq(G) to be well-defined. 

By /o and / max , we denote the empty flow and the maximum flow, i.e., the flow that 
assigns 0 to all edges e and the flow of maximum value encountered by the SSP algorithm, 
respectively. 

Let /j_i and fi be two consecutive flows encountered by the SSP algorithm and let Pj 
be the shortest path in the residual network Gf ^ i _ 1 , i.e., the SSP algorithm augments 
along Pi to increase flow /,_ i to obtain flow /). We call Pi the next path of fi- 1 and 
the previous path of /). To distinguish between the original network G and some residual 
network Gf in the remainder of this article, we refer to the edges in the residual network 
as arcs, whereas we refer to the edges in the original network as edges. 

For a given arc e in a residual network Gf, we denote by eo the corresponding edge 
in the original network G, i.e., eo = e if e E E (i.e. e is a forward arc) and eo = e^ 1 if 
e ^ E (i.e. e is a backward arc). An arc e is called empty (with respect to some residual 
network Gf) if e belongs to Gf, but e _1 does not. Empty arcs e are either forward arcs 
that do not carry flow or backward arcs whose corresponding edge eo carries as much flow 
as possible. We say that an arc becomes saturated (during an augmentation) when it is 
contained in the current augmenting path, but it does not belong to the residual network 
that we obtain after this augmentation. 

In the remainder, a path is always a simple directed path. Let P be a path, and 

P 

let u and v be contained in P in this order. By u -w v, we refer to the sub-path of P 

f — 

starting from node u going to node v, by P we refer to the path we obtain by reversing 
the direction of each edge of P. We call any flow network G' a possible residual network 
(of G ) if there is a flow / for G such that G' = Gf. Paths and cycles in possible residual 
networks are called possible paths and possible cycles, respectively. Let = (V, E L) E" 1 ) 
for E = {e^ 1 : e E E} denote the flow network that consists of all forward arcs and 
backward arcs. 

3 Outline of Our Approach 

Our analysis of the SSP algorithm is based on the following idea: We identify a flow fi e J-q 
with a real number by mapping fi to the length if of the previous path P % of fi. The flow /o 
is identified with Iq = 0. In this way, we obtain a sequence L = (£q, £\,...) of real numbers. 
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We show that this sequence is strictly monotonically increasing with probability 1. Since 
all costs are drawn from the interval [0,1], each element of L is from the interval [0, n]. To 
count the number of elements of L, we partition the interval [0, n] into small sub-intervals 
of length e and sum up the number of elements of L in these intervals. By linearity of 
expectation, this approach carries over to the expected number of elements of L. If e is 
very small, then - with sufficiently high probability - each interval contains at most one 
element. If this is the case then it suffices to bound the probability that an element of L 
falls into some interval (d, d + e\ because this probability equals the expected number of 
elements in (d, d + e]. 

To do so, we assume for the moment that there is an integer i such that li 6 {d, d + 
e]. By the previous assumption that for any interval of length e there is at most one 
path whose length is within this interval, we obtain that li-\ < d. We show that the 
augmenting path Pi uses an empty arc e. Moreover, we will see that we can reconstruct 
the flow fi-i and the path Pi without knowing the costs of edge eo that corresponds to 
arc e in the original network. This allows us to use the principle of deferred decisions: to 
bound the probability that li falls into the interval (d, d + e], we first reveal all costs c e / 
with e' ^ eo- Then Pi is known and its length, which equals li, can be expressed as a 
linear function k + c eo or k — c eg for a known constant k. Consequently, the probability 
that li falls into the interval (d, d + e] is bounded by ecj), as the probability density of c eo 
is bounded by </>. Since the arc e is not always the same, we have to apply a union bound 
over all 2m possible arcs. Summing up over all n/e intervals the expected number of flows 
encountered by the SSP algorithm can be bounded by roughly (n/e) • 2m ■ £(j) = 2 mncj). 

There are some parallels to the analysis of the smoothed number of Pareto-optinral 
solutions in bicriteria linear optimization problems by Beier and Vocking [3], although 
we have only one objective function. In this context, we would call /,; the loser, /,;_i the 
winner, and the difference li — d the loser gap. Beier and Vocking’s analysis is also based on 
the observation that the winner (which in their analysis is a Pareto-optimal solution and 
not a flow) can be reconstructed when all except for one random coefficients are revealed. 
While this reconstruction is simple in the setting of bicriteria optimization problems, the 
reconstruction of the flow /j_i in our setting is significantly more challenging and a main 
difficulty in our analysis. 

4 Proof of the Upper Bound 

Before we start with the analysis, note that due to our transformation of the general 
minimum-cost flow problem to a single-source-single-sink minimum-cost flow problem the 
cost perturbations only affect the original edges. The costs of the auxiliary edges are not 
perturbed but set to 0. Thus, we will slightly deviate from what we described in the 
outline by treating empty arcs corresponding to auxiliary edges separately. 

The SSP algorithm is in general not completely specified, since at some point during the 
run of the algorithm there could exist multiple shortest s-t paths in the residual network 
of the current flow. The SSP algorithm then allows any of them to be chosen as the next 
augmenting path. Due to Lemma [4] and Property [5] we can assume that this is not the 
case in our setting and that the SSP algorithm is completely specified. 
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Lemma 4. For any real e > 0 the probability that there are two nodes u and v and 
two distinct possible u-v paths whose lengths differ by at most e is bounded from above 
by 2 n 2n £(j). 

Proof. Fix two nodes u and v and two distinct possible u-v paths Pi and P 2 . Then there 
is an edge e such that one of the paths - without loss of generality path Pi - contains arc e 
or e^ 1 , but the other one does not. If we fix all edge costs except the cost of edge e, then 
the length of P 2 is already determined whereas the length of Pi depends on the cost c e . 
Hence, c e must fall into a fixed interval of length 2e in order for the path lengths of Pi 
and P 2 to differ by at most e. The probability for this is bounded by 2 e<f> because c e is 
chosen according to a density function that is bounded from above by (f>. A union bound 
over all pairs (u, v ) and all possible u-v paths concludes the proof. □ 

The proof also shows that we can assume that there is no s-t path of length 0 and 
according to Lemma [4] we can assume that the following property holds since it holds with 
a probability of 1. 

Property 5. For any nodes u and v the lengths of all possible u-v paths are pairwise 
distinct. 

Lemma 6. Let di{v) denote the distance from s to node v and d[{v) denote the distance 
from node v to t in the residual network Gj t . Then the sequences do(v),di(v),d 2 {v),... 
and d' 0 (v), dffv), d' 2 (v),... are monotonically increasing for every v G V. 

Proof. We only show the proof for the sequence do(v), d\(v), d 2 (v), <... The proof for 
the sequence d' 0 (v), d\ (v), d' 2 (v ),... can be shown analogously. Let i > 0 be an arbitrary 
integer. We show di(v) < di+i(v) by induction on the depth of node v in the shortest 
path tree Tj+i of the residual network Gf i+1 rooted at s. For the root s, the claim 
holds since di(s) = dj_|_i(s) = 0. Now assume that the claim holds for all nodes up 
to a certain depth k, consider a node v with depth k + 1, and let u denote its parent. 
Consequently, di+i(v) = di+i(u) + c e for e = (u,v). If arc e has been available in Gf t , 
then di(v) < di(u) + c e . If not, then the SSP algorithm must have augmented along e _1 in 
step * + 1 to obtain flow fi + \ and, hence, di(u) = di(v) + c e -1 = di(v) — c e . In both cases 
the inequality di(v) < di(u ) + c e holds. Applying the induction hypothesis for node u, we 
obtain di(v) < di(u) + c e < di+i(u) + c e = di+i(v). □ 

Definition 7. For a flow f E Fq, we denote by £_(fi) and d+(fi) the length of the previous 
path Pj and the next path P )+ i of f, respectively. By convention, we set l_(fo) = 0 and 
f’+C/max) = 00 . If the network G is clear from the context, then we simply write £_(fi) 
and £ + (fi). By we denote the cost function that maps reals x from the interval [0, |/ max |] 
to the cost of the cheapest flow f with value x, i.e., ^(x) = min(c(/) : |/| = x). 

The lengths £_(fi) correspond to the lengths £i mentioned in the outline. The apparent 
notational overhead is necessary for formal correctness. In Lemma [9j we will reveal a 
connection between the values £_(fi) and the function . Based on this, we can focus on 
analyzing the function ^. 

Lemma [6] implies in particular that the distance from the source s to the sink t is 
monotonically increasing, which yields the following corollary. 
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Corollary 8. Let fl, fj £ Fq be two flows with i < j. Then £_(fi) < (/-,). 

Lemma 9. The function T? is continuous, monotonically increasing, and piecewise linear, 
and the break points of the function are the values of the flows f £ Fq with £_(f) < •£,(/). 
For each flow f £ Fq, the slopes of to the left and to the right of \f\ equal £_(f) 
and £ + {f), respectively. 

Proof. The proof follows from Theorem [3] and the observation that the cost of the flow 
is linearly increasing when gradually increasing the flow along the shortest path in the 
residual network until at least one arc becomes saturated. The slope of the cost function 
is given by the length of that path. □ 


Example 10. Consider the flow network depicted in Figure\ 1} The cost c e and the capac¬ 
ity u e of an edge e are given by the notation c e ,u e . For each step of the SSP algorithm, 
Figure [3| lists the relevant part of the augmenting path (excluding s, s', t', and t), its 
length, the amount of flow that is sent along that path, and the arcs that become saturated. 
As can be seen in the table, the values \ f\ of the encountered flows f £ Fq are 0, 2, 3, 5, 
7, 10, and 12. These are the breakpoints of the cost function T>, and the lengths of the 
augmenting paths equal the slopes of T? (see Figure [^j. 


u 



Figure 1: Minimum-cost flow network with mas¬ 
ter source s and master sink t. 


c 



Figure 2: Cost function . 
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Figure 3: The augmenting paths for Example |10| 


With the following definition, we lay the foundation for distinguishing between original 
edges with perturbed costs and auxiliary edges whose costs are set to 0. 

Definition 11. Let f £ Fq be an arbitrary flow. An empty arc e in the residual network Gf 
that does not correspond to an auxiliary edge is called a good arc. We call f a good flow if 
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f fl fo an d if the previous path of f contains a good arc in the previous residual network. 
Otherwise, f is called a bad flow. 

Before we can derive a property of good arcs that are contained in the previous path 
of good flows, we need to show that for each flow value the minimum-cost flow is unique 
with probability 1 . 

Lemma 12. For any real e > 0 the probability that there exists a possible cycle whose 
costs lie in [ 0 , e] is bounded from above by 2 n 2n £(j). 

Proof. Assume that there exists a cycle K whose costs lie in [0,e]. Then K contains two 

i — 

nodes u and v and consists of a u-v path Pi and a v-u path P 2 . Then Pi and P 2 are two 

i — 

distinct u-v paths. Since K has costs in [0,e], the costs of Pi and P 2 differ by at most e. 
Now Lemma [4] concludes the proof. □ 

According to Lemma [12] we can assume that the following property holds since it holds 
with a probability of 1 . 


Property 13. There exists no possible cycle with costs 0. 


With Property 13 we can show that the minimum-cost flow is unique for each value. 


Lemma 14. For each value B G M>o there either exists no flow f with |/| = B or there 
exists a unique minimum-cost flow f with |/| = B. 

Proof. Assume that there exists a value B G M>o and two distinct minimum-cost flows / 
and f with |/| = \f'\ = B. Let Pa := {e G P | f e 7 ^ /'} be the set of edges on which / 
and f differ. We show in the following that the set Pa contains at least one undirected 
cycle I\. Since / and f are distinct flows, the set Pa cannot be empty. For v G V, let 
us denote by f-{y) = E e={u,v)&E fe the flow entering v and by f + (v) = E e =(v,w)eE fe 
the flow going out of v {f'_{v) and f+(v) are defined analogously). Flow conservation and 
|/| = | f'\ imply f~(v) — f_(v) = f+(v) — f' + {v) for all v G V. Now let us assume Pa does 
not contain an undirected cycle. In this case there must exist a vertex v G V with exactly 
one incident edge in Pa- We will show that this cannot happen. 

Assume f~(v) — fL(v) 0 for some v €V. Then the flows / and f differ on at least 
one edge e = (u,v) G P. Since this case implies f+(v) — f + {y) 0, they also differ on at 

least one edge e' = (v,w) G P and both these edges belong to Pa- It remains to consider 
nodes v G V with f-(v) — f'_{v) = f+(v) — f' + {v) = 0 and at least one incident edge in Pa- 
For such a node v there exists an edge e = (u,v) G P (or e = (v , w) G P) with f e /'. It 
follows Ee'=(«» 6 PeVe fe' ~ ?e' + 0 ( or E e '=( w y)eE,c'p fe' - f'e' + °) wh ich implies that 
there exists another edge e! = ( uv) e (or e = (v, w') e ) with f e i /',. 

For the flow f" = \f + \ f, which has the same costs as / and f and is hence a 
minimum-cost flow with | f"\ = B as well, we have f"{e) G (0,u e ) for all e G Pa- The 
flow f" can therefore be augmented in both directions along K. Due to Property 13 
augmenting f" in one of the two directions along K will result in a better flow. This is a 
contradiction. □ 
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Now we derive a property of good arcs that are contained in the previous path of good 
flows. This property allows us to bound the probability that one of the lengths £_(fi) falls 
into a given interval of length e. 

Lemma 15. Let f G Fq be a predecessor of a good flow for which £_(f) < £+(f) holds 
Additionally, let e be a good arc in the next path of f, and let eo be the edge in G that 
corresponds to e. Now change the cost of eo to c' eQ = 1 (d = 0) if eo = e (eo = e~ l ), 
i.e., when e is a forward (backward) arc. In any case, the cost of arc e increases. We 
denote the resulting flow network by G’. Then f E Fo(G'). Moreover, the inequalities 
t±{f) < £(/) < *?(/) < *?'(/) hold. 

Proof. Let 'if and c to' be the cost functions of the original network G and the modified 
network G' , respectively. Both functions are of the form described in Lemma [9] In 
particular, they are continuous and the breakpoints correspond to the values of the flows 
/ € Fq(G) and / € Fo(G') with £^(f) < ^(/) and £_ {f) < £%(/), respectively. 

We start with analyzing the case eo = e. In this case, we set = TP and observe 
that increasing the cost of edge eo to 1 cannot decrease the cost of any flow in G. Hence, 
'if" > 'if. Since flow / does not use arc e, its costs remain unchanged, i.e., < if // (|/|) = ^(|/|). 

If eo = e -1 , then we set c €" = % Ji + A eo for A eo = u eo ■ c eo . This function is also 
piecewise linear and has the same breakpoints and slopes as < *? / . Since the flow on edge eo 
cannot exceed the capacity u eo of edge eo and since the cost on that edge has been reduced 
by c eg in G' , the cost of each flow is reduced by at most A eo in G'. Furthermore, this 
gain is only achieved for flows that entirely use edge eo like / does. Hence, T>" > ^ and 

^"(l/l) = tffl/D- 


c 



I/I 


Figure 4: Cost function and function c £ h 


Due to c €" > T?, c ^"{\f\) = ^(|/|), and the form of both functions, the left-hand 
derivative of at |/| is at most the left-hand derivative of ^ at |/| (see Figure |d]). 
Since |/| is a breakpoint of this implies that |/| is also a breakpoint of and that 
the slope of c €" to the left of |/| is at most the slope of 'if to the left of |/|. For the same 
reasons, the right-hand derivative of T>" at |/| is at least the right-hand derivative of c € 
at |/| and the slope of c €" to the right of |/| is at least the slope of ‘if to the right of |/|. 
These properties carry over to < tf'. Hence, Fq(G 1 ) contains a flow f with \f'\ = |/|. Since 
/ is a minimum-cost flow with respect to c, f is a minimum-cost flow with respect to d, 


we have d(f) = c(f) and d(f*) > c(f*) for all possible flows /*, Lemma 14 yields f = f 
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Algorithm 2 Reconstruct^, d). 

1: let eo be the edge that corresponds to arc e in the original network G 
2: change the cost of edge eo to d e = 1 if e is a forward arc or to c' eQ = 0 if e is a backward 
arc 

3: start running the SSP algorithm on the modified network G' 

4: stop when the length of the shortest s-t path in the residual network of the current 
flow f exceeds d 
5: output f 


and therefore / £ Fq{G'). Recalling the fact that the slopes correspond to shortest s-t 
path lengths, the stated chain of inequalities follows. □ 


Lemma 15 suggests Algorithm^ (Reconstruct) for reconstructing a flow / based on a 
good arc e that belongs to the shortest path in the residual network Gf and on a threshold 
d £ [£_(/),£ flf)). The crucial fact that we will later exploit is that for this reconstruction 
the cost c eo of edge eo does not have to be known. (Note that we only need Reconstruct 
for the analysis in order to show that the flow / can be reconstructed.) 


Corollary 16. Let f £ Pq be a predecessor of a good flow, let e be a good arc in the next 
path of f, and let d £ [£_(f),£ + (f)) be a real number. Then Reconstruct^, d) outputs 
flow f. 


Proof. By applying Lemma 


15 


we obtain / £ Pq(G') and £p_ (/) < d < £+ (/). Together 
with Corollary [8j this implies that Reconstruct^, d) does not stop before encountering 
flow / and stops once it encounters /. Hence, Reconstruct^, d) outputs flow /. □ 


Corollary 1 16| is an essential component of the proof of Theorem [l] but it only describes 
how to reconstruct predecessor flows / of good flows with £_(f) < £ + (f)- In the next part 
of this section we show that most of the flows are good flows and that, with a probability 
of 1, the inequality £_(f) < £+(f) holds for any flow / £ J-q. 


Lemma 17. In any step of the SSP algorithm, any s-t path in the residual network 
contains at least one empty arc. 


Proof. The claim is true for the empty flow /o- Now consider a flow f) £ T, its predecessor 
flow fi-i, the path Pi, which is a shortest path in the residual network and an 

arbitrary s-t path P in the current residual network Gf i . We show that at least one arc 
in P is empty. 

For this, fix one arc e = (x, y) from Pi that is not contained in the current residual 
network Gf t since it became saturated by the augmentation along Pi. Let v be the first 

P 

node of P that occurs in the sub-path y t of Pi, and let u be the last node in the 
sub-path s v of P that belongs to the sub-path s 5* x of Pi (see Figure [ 5 ]). By the 

p 

choice of u and v, all nodes on the sub-path P' = u v of P except u and v do not 
belong to Pi. Hence, the arcs of P' are also available in the residual network Gf i _ 1 and 
have the same capacity in both residual networks Gf._ 1 and Gf i . 
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Figure 5: Paths P and Pi in the residual network Gf t . 


In the remainder of this proof, we show that at least one arc of P' is empty. Assume 

to the contrary that none of the arcs is empty in Gf i and, hence, in Gf i _ 1 . This implies 

that, for each arc e E P' , the residual network Gf i _ 1 also contains the arc e^ 1 . Since Pi is 

the shortest s-t path in Gf i _ 1 and since the lengths of all possible s-t paths are pairwise 

P'PP' P 

distinct, the path s u v t is longer than Pi. Consequently, the path P' = u v is 

p 

longer than the path u v. This contradicts the fact that flow /*_ i is optimal since the 
arcs of path u v combined with the reverse arcs e^ 1 of all the arcs e of path P' form a 
directed cycle C in Gf i _ l of negative costs. O 

We want to partition the interval [0, n] into small sub-intervals of length e and treat the 
number of lengths £_(fi) that fall into a given sub-interval as a binary random variable. 
This may be wrong if there are two possible s-t paths whose lengths differ by at most e. In 
this case whose probability tends to 0 (see Lemma [4]) we will simply bound the number of 
augmentation steps of the SSP algorithm by a worst-case bound according to the following 
lemma. 

Lemma 18. The number |J-o| °.f flows encountered by the SSP algorithm is bounded 
by 3 m+n . 

Proof. We call two possible residual networks equivalent if they contain the same arcs. 
Equivalent possible residual networks have the same shortest s-t path in common. The 
length of this path is also the same. Assume that for two distinct flows fl, fj E Pi o with 
i < j, the residual networks Gf t and Gf. are equivalent. We then have £_(fip±) = £p(fi) 
= £+(fj) = £-{fj+ 1) and due to Corollary [8j £_{f i+ 1 ) = £_(fk) = t-(fj+ 1) for all i < 
k < j + 1. Property [5] then implies P t p\ = Pk for all i < k < j + 1 and especially 
Pipi = Pip 2 , which is a contradiction. Therefore the number of equivalence classes is 
bounded by 3 m+n since there are m original edges and at most n auxiliary edges. This 
completes the proof. □ 


Lemma 19. There are at most n bad flows f E P. 


Proof. According to Lemma 17, the augmenting path contains an empty arc e in each step. 
If e is an arc that corresponds to an auxiliary edge (this is the only case when e is not a 
good arc), then e is not empty after the augmentation. Since the SSP algorithm does not 
augment along arcs e~ l if e is an arc that corresponds to an auxiliary edge, non-empty 
arcs that correspond to auxiliary edges cannot be empty a second time. Thus, there can 
be at most n steps where the augmenting path does not contain a good arc. This implies 
that there are at most n bad flows / E P. □ 
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We can now bound the probability that there is a flow /;£ J whose previous path’s 
length £_(fi) falls into a given sub-interval of length e. Though we count bad flows 
separately, they also play a role in bounding the probability that there is a good flow fl G T 
such that £_(fi) falls into a given sub-interval of length e. 


Lemma 20. For a fixed real d > 0, let E^ )£ be the event that there is a flow f G T 
for which £_(f) G (d, d + e\, and let B d , £ be the event that there is a bad flow f G F 
for which £_(f) G (d, d + e\. Then the probability of E^ e can be bounded by Pr [Ed,e] < 
2me<f + 2 • Pr[B d , e }. 


Proof. Let A^ e be the event that there is a good flow / G T for which £_(/) G (d,d + e]. 
Since E^ e = A dt£ UB d£ , it suffices to show that Pr [A dt£ ] < 2ms(j)+Pr [B^ e \. Consider the 
event that there is a good flow whose previous path’s length lies in the interval (d, d + e]. 
Among all these good flows, let / be the one with the smallest value £_(f), i.e., / is the 
first good flow / encountered by the SSP algorithm for which £_(f) G (d, d + e], and 
let f* be its previous flow. Flow /* always exists since f cannot be the empty flow /o- 
Corollary |i] and Property [H] yield £_{f*) < •£_(/). Thus, there can only be two cases: If 
£_(f*) G (d,d + e], then f* is a bad flow by the choice of f and, hence, event B de occurs. 
The interesting case, which we consider now, is when £_(f*) < d holds. If this is true, 
then d G [£_{f*),£ + (f*)) due to £ + (f*) = £-{f)- 

As / is a good flow, the shortest path in the residual network Gf* contains a good arc 
e = (u,v). Applying Corollary 16 we obtain that we can reconstruct flow f* by calling 
Reconstruct^, d). The shortest s-t path P in the residual network Gf* is the previous path 

of / and its length equals £_(f). Furthermore, P is of the form s u —>• v t, where 
p p 

s u and v t are shortest paths in Gf* from s to u and from v to t. respectively. 
These observations yield 


Ad,s G Re } d,e U Ft e -i d y£ U Bd,e , 

e£E e£E 


where R e ^,e for some arc e = (u,v) denotes the following event: The event R et d,e occurs 
if £ G (d,d + s], where l is the length of the shortest s-t path that uses arc e in Gf, 
the residual network of the flow / obtained by calling the procedure Reconstruct^, d). 
Therefore, the probability of event Ad t£ is bounded by 


£ Pr [R eA£ ] + E Pr 

e£E eS E 



+ Pr [B d>e ] . 


We conclude the proof by showing Pr [R e ,d,e] < £<t>- For this, let eo be the edge corre¬ 
sponding to arc e = (u, v ) in the original network. If we fix all edge costs except cost c eo 
of edge eo, then the output / of Reconstruct^, d) is already determined. The same holds 
for the shortest s-t path in Gf that uses arc e since it is of the form s u —>• v t where 
P\ = s u is a shortest s-u path in Gf that does not use v and where P 2 = v t is a 
shortest v-t path in Gf that does not use u. The length £ of this path, however, depends 
linearly on the cost c eo . To be more precise, £ = £' + c e = £' + sgn(e) • c eo , where £' is the 
length of P\ plus the length of P 2 and where 


sgn(e) 


+1 if e 0 = e , 

—1 if eo = e -1 . 
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Hence, t falls into the interval (d, d + e] if and only if c eo falls into some fixed interval 
of length e. The probability for this is bounded by £(f> as c eo is drawn according to a 
distribution whose density is bounded by (f>. □ 


Corollary 21. The expected number of augmentation steps the SSP algorithm performs 
is bounded by 2 mn<f> + 2 n. 

Proof. Let X = | F\ be the number of augmentation steps of the SSP algorithm. For reals 
d, e > 0, let Ed.e and Bd t£ be the events defined in Lemma 20, let X^ £ be the number of 
flows / £ L for which l_(f) G (d,d + e], and let Zd }£ = min {X^ e , 1} be the indicator 
variable of event E^ £ . 

Since all costs are drawn from the interval [0,1], the length of any possible s-t path is 
bounded by n. Furthermore, according to Corollary |8j all lengths are non-negative (and 
positive with a probability of 1). Let F e denote the event that there are two possible s-t 
paths whose lengths differ by at most e. Then, for any positive integer k, we obtain 



< 3 m + n 


if Fn does not occur , 

k ’ 

if Fn. occurs. 

k 


Consequently, 


fc-i 


E[X]<^E 

i =0 
k-1 

= E Pr 


i=0 


Zj.n. n 


E j.n. s 

k ’ k 


k-1 


+ 3 


m+n 


Pr 


3 m+ n . p r 


p' n 


< 2 mruj) + 2 • 


Bj.n. il 

1 k ’ k 


+ 3 


m+n 


i =0 


< 2 mn<j> + 2n + 3 


m+n 


Pr 


Pr 


The second inequality is due to Lemma 20 whereas the third inequality stems from 
Lemma 19 The claim follows since Pr Fn —> 0 for k 


Lemma IT 


oo in accordance with 
□ 


Now we are almost done with the proof of our main theorem. 

Proof. Since each step of the SSP algorithm runs in time 0(m + nlogn) using Dijkstra’s 
algorithm (see, e.g., Korte m for details), applying Corollary |2l| yields the desired result. 

n 


5 Proof of the Lower Bound 

This section is devoted to the proof of Theorem [2] For given positive integers n, m G 
(to, ..., n 2 }, and 4> < 2 n let k = [log 2 4>\ — 5 = 0(n) and M = min jn, 2 L lo S 2 <AJ /4 — 2| = 
0(min{n, 4>}). In the following we assume that 4> > 64, such that we have k,M > 1. If 


15 























0 < 64, the lower bound on the number of augmentation steps from Theorem [2] reduces to 
Q(m) and a simple flow network like the network G\, as explained below, which we will 
use as initial network in case 0 > 64, with 0(n ) nodes, 0{m) edges, and uniform edge 
costs proves the lower bound. 

We construct a flow network with 2n + 2k + 2 + 4 M = 0(n) nodes, m + 2n + 4k — 
4 + 8 M = 0(m) edges, and smoothing parameter 0 on which the SSP algorithm requires 
m ■ 2 k ~ 1 ■ 2M = Q(m ■ cj) ■ min{n, 0}) augmentation steps in expectation. To be exact, we 
show that for any realization of the edge costs for which there do not exist multiple paths 
with exactly the same costs (Property [5]) the SSP algorithm requires that many iterations. 
Since this happens with probability 1, we will assume in the following that Property [5] 
holds without further mention. 

For the sake of simplicity we consider edge cost densities f e : [0,^>] —>• [0,1] instead 
of f e : [0,1] —>• [0,0]. This is an equivalent smoothed input model because both types of 
densities can be transformed into each other by scaling by a factor of 0 and because the 
behavior of the SSP algorithm is invariant under scaling of the edge costs. Furthermore, 
our densities f e will be uniform distributions on intervals I e with lengths of at least 1. In 
the remainder of this section we only construct these intervals I e . Also, all minimum-cost 
flow networks constructed in this section have a unique source node s and a unique sink 
node t. which is always clear from the context. The balance values of the nodes are defined 
as b(v) = 0 for all nodes v ^ {s,f} and — b(t ) = b(s) = Yle=(s,v) u z = J2e=(w,t) u ei that is> 
each 6-flow equals a maximum s-f-flow. 

The construction of the desired minimum-cost flow network G consists of three steps, 
which we sketch below and describe in more detail thereafter. Given Property [5] our 
choice of distributions for the edge costs ensures that the behavior of the SSP algorithm 
is the same for every realization of the edge costs. 

1. In the first step we define a simple flow network G± with a source si and a sink t\ 
on which the SSP algorithm requires m augmentation steps. 

2. In the second step we take a flow network Gi, starting with i = 1, as the basis for 
constructing a larger flow network Gi +We obtain the new flow network by adding 
a new source Sj+i, a new sink and four edges connecting the new source and 
sink with the old source and sink. Additionally, the latter two nodes are downgraded 
to “normal” nodes (nodes with a balance value of 0) in Gi +1 (see Figure [7]). By a 
careful choice of the new capacities and cost intervals we can ensure the following 
property: First, the SSP algorithm subsequently augments along all paths of the 
form 

Sj+l ^ ti ^ ti -|_i , 

where P is an path encountered by the SSP algorithm when run on the net¬ 
work Gi. Then, it augments along all paths of the form 

■<— 

®t+l t ti Si )■ ti-y .i, 

where P is again an Si~t t path encountered by the SSP algorithm when run on the 
network Gi. Hence, by adding two nodes and four edges we double the number 
of iterations the SSP algorithm requires. For this construction to work we have to 
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double the maximum edge cost of our flow network. Hence, this construction can 
be repeated k — 1 ~ log<^ times, yielding an additional factor of 2 fc “ 1 ~ (f> for the 
number of iterations required by the SSP algorithm. 

3. In the third step we add a global source s and a global sink t to the flow network Gk 
constructed in the second step, and add four directed paths of length M « min{n, </>}, 
where each contains M new nodes and has exactly one node in common with G^. 
The first path will end in Sk, the second path will end in tk, the third path will start 
in Sk, and the fourth path will start in tk- We will also add an arc from s to every new 
node in the first two paths and an arc from every new node in the last two paths to t 
(see Figure [8J. We call the resulting flow network G. By the right choice of the edge 
costs and capacities we will ensure that for each Sk-tk path P in Gk encountered by 
the SSP algorithm on Gk the SSP algorithm on G encounters M augmenting paths 

■i — 

having P as a sub-path and M augmenting paths having P as a sub-path. In this 
way, we gain an additional factor of 2 M for the number of iterations of the SSP 
algorithm. 

In the following we say that the SSP algorithm encounters a path P on a flow net¬ 
work G' if it augments along P when run on G'. 

Construction of G\. For the first step, consider two sets U = {iti,... ,u n } and W = 
{w \,..., w n } of n nodes and an arbitrary set Euw QUxW containing exactly \Euw\ = m 
edges. The initial flow network G\ is defined as G\ = for V\ = U U W U {si,ti} 

and 

Ei = ({si} x U) U E uw U (IF x {t\ }) . 

The edges e from Euw have capacity 1 and costs from the interval I e = [7,9]. The edges 
(s\,Ui),Ui £ U have a capacity equal to the out-degree of Ui, the edges (wj,ti),Wj E W 
have a capacity equal to the in-degree of Wj and both have costs from the interval I e = [ 0 , 1] 
(see Figure [b]). (Remember that we use uniform distributions on the intervals I e .) 

G i Euw 

U\ W 1 



Figure 6: Example for G\ with n = 3 and m = 7 with capacities different from 1 shown 
next to the edges and the cost intervals shown below each edge set. 
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Lemma 22. The SSP algorithm requires exactly m iterations on G± to obtain a max¬ 
imum s i -t\ -flow. Furthermore all augmenting paths it encounters have costs from the 
interval [7,11]. 

Proof. First we observe that the SSP algorithm augments only along paths that are of the 
form si — >• Ui — >• Wj —>• t\ for some Ui E U and Wj E W: Consider an arbitrary augmenting 
path P the SSP algorithm encounters and assume for contradiction that P is not of this 
form. Due to the structure of G\. the first two edges of P are of the form (si,Ui) and 
(■ Ui,Wj ) for some Ui E U and Wj E W. The choice of the capacities ensures that the 
edge (wj,t i) cannot be fully saturated if the edge ( Ui,Wj ) is not. Hence, when the SSP 

algorithm augments along P, the edge (■ Wj,t\ ) is available in the residual network. Since 

P 

this edge is not used by the SSP algorithm, the sub-path Wj t\ has smaller costs than 
the edge (wj,ti). This means that the distance of Wj to the sink t\ in the current residual 
network is smaller than in the initial residual network for the zero flow. This contradicts 
Lemma [6] 

Since every path the SSP algorithm encounters on G± is of the form si —>• m —>• Wj t\, 
every such path consists of two edges with costs from the interval [0,1] and one edge with 
costs from the interval [7, 9]. This implies that the total costs of any such path lie in the 
interval [7,11]. 

The choice of capacities ensures that on every augmenting path of the form si —> Ui —> 
Wj —> t.\ the edge (itj, Wj) is a bottleneck and becomes saturated by the augmentation. As 
flow is never removed from this edge again, there is a one-to-one correspondence between 
the paths the SSP algorithm encounters on G i and the edges from Euw This implies 
that the SSP algorithm encounters exactly rn paths on Gj. □ 

Construction of Gj+i from Gi. Now we describe the second step of our construction 
more formally. Given a flow network Gi = (Vi, Ef) with a source s, and a sink ti, we define 
G i+ 1 = (V i+1 ,E i+ i), where V i+1 = V U {s i+ i,t i+1 } and 

Ei-\- 1 — Ei U X U X . 

Let Ni = 2* _1 • m, which is the value of the maximum Si-t t flow in Gi. The new edges 

e E {(sj+i, sf), (ti, tj+i)} have capacity u e = Ni and costs from the interval I e = [0,1]. 
The new edges e E {(sj+i, tf), (si, tj+i)} also have capacity u e = Ni, but costs from the 
interval I e = [2* +3 — 1,2* +3 + 1] (see Figure [7]). 

Next we analyze how many iterations the SSP algorithm requires to reach a maximum 
Si+i-tj+i flow in Gj+i when run on the network Gj+i. Before we can start with this 
analysis, we prove the following property of the SSP algorithm. 

Lemma 23. After augmenting flow via a cheapest v-w-path P in a network without a 

i .— 

cycle with negative total costs, P is a cheapest w-v-path. 

i — i — 

Proof. Since we augmented along P, all edges of P will be part of the residual network. P 
will therefore be a feasible tc-u-path. Assume that after augmenting along P there exists 

f — 

a rc-u-path P that is cheaper than p. Let us take a look at the multi-set X = P U P', 
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Figure 7: G l+ \ with Gi as sub-graph with edge costs next to the edges. 


which contains every arc eCPflP' twice. The total costs of this multi-set are negative 
because 

c(P) + c{P') = -c(P) + c(P') < 0 

by the assumption that P’ is cheaper than P. Furthermore, for each node the number of 
incoming and outgoing arcs from X is the same. This property is preserved if we delete 
all pairs of a forward arc e and the corresponding backward arc e _1 from X , resulting in 
a multi-set X' Cl. The total costs of the arcs in X 1 are negative because they equal the 
total costs of the arcs in X. 

For every arc e £ X that did not have positive residual capacity before augmenting 
along P , the arc e -1 must be part of P and therefore be part of X as well. This is due to 
the fact that only for arcs e with e -1 E P the residual capacity increases when augmenting 
along P. Since all such pairs of arcs are deleted, the set X' will only contain arcs that 
had a positive residual capacity before augmenting along P. Since each node has the same 
number of outgoing and incoming arcs from X ', we can partition X' into subsets, where 
the arcs in each subset form a cycle. Since the total costs of all arcs are negative at least 
one of these cycles has to have negative costs, which is a contradiction. □ 

Since during the execution of the SSP algorithm all residual networks have conservative 
costs on the arcs, Lemma [23] always applies. 

Lemma 24. Let i > 1. All Si-ti-paths the SSP algorithm encounters when run on the 
network Gi have costs from the interval [7,2* +3 — 5]. Furthermore the SSP algorithm 
encounters on the network G l+ \ twice as many paths as on the network Gi. 

Proof. We prove the first half of the lemma by induction over i. In accordance with 
Lemma [22] all paths the SSP algorithm encounters on G\ have costs from the inter¬ 
val [7,11] = [7,2 4 — 5]. 

Now assume that all paths the SSP algorithm encounters in Gi, for some i > 1, have 
costs from the interval [7, 2* +3 — 5]. We distinguish between three different kinds of Sj+i- 
L + i-paths in Gi + \. 

Definition 25. We classify the possible Sj+i -L+i -paths P in G l+ \ as follows. 

1. If P = Sj+i -4 Si ti —>• ti- |_i, then P is called a type-l-path. 
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2. If P = s i+ 1 

3. If P = s i+ 1 
For any type-2-path P we have 


Si —>• ti +1 or P = Sj_|_i —y ti —> ti- |_i, then P is called a type-2-path. 
ti Si —^ |_i, i/ien P is called a type-3-path. 


c(P) G [0 + (2*+ 3 - 1), 1 + (2 l+i + 1)] = [2*+ 3 - 1,2 l+3 + 2] C [7,2 l+4 - 5] . 

Since due to Lemma [6] the distance from ti to t, t +\ does not decrease during the run of 
the SSP algorithm, the SSP algorithm will only augment along a type-3-path P once the 
edge (U,ti- |_i) is saturated. Otherwise the tj-tj+i-sub-path of P could be replaced by the 
edge (U,ti- |_i) to create a cheaper path. Once the edge 1 ) has been saturated, the 

SSP algorithm cannot augment along type-l-paths anymore. Therefore, the SSP algorithm 
will augment along all type-l-paths it encounters before it augments along all type-3-paths 
it encounters. 

Since during the time the SSP algorithm augments along type-l-paths no other aug¬ 
mentations alter the part of the residual network corresponding to Gi, the correspond¬ 
ing sub-paths P' are paths in Gi that the SSP algorithm encounters when run on the 
network Gi. Using the induction hypothesis, this yields that all type-l-paths the SSP 
algorithm encounters have costs from the interval 

[0 + 7 + 0,1 + (2* +3 - 5) + 1] = [7, 2* +3 - 3] C [7, 2 i+4 - 5] . 

Since all of these type-l-paths have less costs than the two type-2-paths, the SSP algorithm 
will augment along them as long as there still exists an augmenting Sj-tj-sub-path P'. Due 
to the choice of capacities this is the case until both edges (sj+i,Sj) and (U,ti+ 1 ) are 
saturated. Therefore, the SSP algorithm will not augment along any type-2-path. 

When analyzing the costs of type-3-paths, we have to look at the fj-Sj-sub-paths. Let 
l be the number of Si-L-paths the SSP algorithm encounters when run on the network Gi 
and let P\, P 2 ,..., Pi be the corresponding paths in the same order, in which they were 
encountered. Then Lemmagyields that for any j e {1, after augmenting along the 

paths P\. P‘ 2 - ■ ■ ■ ■ Pj the cheapest ti-Si -path in the residual network is Pj. Property myields 
that it is the only cheapest path. Also the residual network we obtain, if we then augment 

i — 

via Pj is equal to the residual network obtained, when only augmenting along the paths 
Pi, P 2 ,, Pj- 1 . Starting with j = I this yields that the L-Sj-sub-paths corresponding to 

ff — 

the type-3-paths the SSP algorithm encounters are equal to Pp...., P\. By induction the 
cost of each such path Pj lies in [7, 2 l+3 — 5]. This yields that every type-3-path the SSP 
algorithm encounters has costs from the interval 

[(2 i+3 - 1) - (2 i+3 - 5) + (2 i+3 - 1), (2* +3 + 1) - 7 + (2* +3 + 1)] 

= [2* +3 + 3,2* +4 - 5] C [7,2 i+4 - 5] . 

The previous argument also shows that the SSP algorithm encounters on Gi +1 twice 
as many paths as on Gi because it encounters £ type-l-paths, no type-2-path, and l type- 
3-paths, where £ denotes the number of paths the SSP algorithm encounters on Gi. □ 

Since the SSP algorithm augments along m paths when run on the network G\, it will 
augment along 2* _1 • m paths when run on the network Gi. Note, that at the end of the 
SSP algorithm, when run on Gi for i > 1, only the 4 arcs incident to s, and ti carry flow. 
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Construction of G from Gk• Let Nk = 2 k 1 • m, which is the value of a maximum 
Sk-tk flow in Gk- We will now use Gk to define G = (V, E) as follows (see also Figure [8J. 

• V := Vk U A U B U C U D U {s, t}, with A := {ai, a 2 ,..., aw}, B := {b\, b 2 ,..., 6m}, 
c : = (ci,C 2 ,... ,c M }, and D := {di,d 2 ,... ,d M }- E := E k U E a U E b U E c U E d . 

• E 0 contains the edges (a;,aj_ 1 ), i G (2,..., M}, with cost interval [2 fc + 5 — l,2 fc+5 ] 

and infinite capacity, (s,aj), * G with cost interval [0,1] and capacity 

Nk, and (ai,Sfc) with cost interval [2 fc+4 — 1, 2 fc+4 ] and infinite capacity. 

• Eb contains the edges (fo®, _ 1 ), i G {2, with cost interval [2 fc+5 — l,2 fc+5 ] 

and infinite capacity, ( s,bi ), i G {1 with cost interval [0,1] and capacity 

IV*,, and (b\,tk) with cost interval [2 fc_,_5 — l,2 fe+5 ] and infinite capacity. 

• E c contains the edges (cj_i,Cj), i G {2, ...,M}, with cost interval [2 fc+5 — 1,2 fc+5 ] 
and infinite capacity, + ,+ i G {1,...,M}, with cost interval [0,1] and capacity 
Nk, and (s k ,ci) with cost interval [2 fc+5 — l,2 fc+5 ] and infinite capacity. 

• Ed contains the edges (dj-i, di), i G {2,..., M}, with cost interval [2 fc+5 — 1, 2 fc+5 ] 
and infinite capacity, (di, t ), i G {1,..., ?n}, with cost interval [0,1] and capacity N k , 
and (tk,d\) with cost interval [2 fc+4 — l,2 fc+4 ] and infinite capacity. 

capacity 00 
capacity F 


capacity F 
capacity 00 

Figure 8: G with G k as sub-graph with approximate edge costs on the edges. A value c 
below an edge e means the the cost of e is drawn uniformly at random from the interval 
[c- l,c]. 


9/0+5 9/0+5 o/cH -5 9/0+5 9/0+4 9/0+5 9/0+5 9/0+5 9/0+5 9/0+5 

—> a 4 —> a 3 Z —> a 2 Z —> a 4 —► —> c \-—> c 2 Z —> c 3 Z —> c 4 Z —> c 5 



65 -> 64 -> 63 -» 62 -* b\ -> tk -> d\ -> C?2 -> C?3 -> C?4 -> C?5 

2^+5 2 /c+5 2^+^ 2^4“^ 2^ c- i _ 4 2^+^ 2 fc+5 2^^"^ 


Theorem 26. The SSP algorithm encounters m ■ 2 k 1 • 2 M paths on the network G. 

Proof. We categorize the different -s-t-paths the SSP algorithm encounters on G by the 
node after s and the node before t. Each such s-t-path can be described as an {aj,c+, 
{ai,dj}-, {bi,Cj}~, or {+ d+path for some i, j G {1,..., M}. 
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All Sk~tk ~paths encountered by the SSP algorithm, when run on Gk, have costs from 
the interval [7, 2 fc+3 — 5] in accordance with Lemma 24 For any i £ {!,..., m}, the costs 


of the s-aj-Sfc-path and the tk-di-t- path lie in [ai,oti + (i +1 )] with on = 2 k+5 i — 2 fc+4 — i and 
the costs of the s-bi-tk- path and the Sk-Ci-t -path lie in [/%, /?«+ (i + 1)] with = 2 k+5 i — i. 
Furthermore i < M + 1 < 2 fc+3 . 

Therefore, the SSP algorithm will only augment along {a*, Cj}-paths if no {a*, c/j}-paths 
are available. Also, any {ai,di}- path is shorter than any {6j,Cj}-path and any {bi,Ci}- 
path is shorter than any {aj+i, dj+i}-path. Finally, any {bi, Cj}-path is shorter than any 
{aj+i, Cj}-path or {6j, dj+i}-path. Therefore, the SSP algorithm will start with augment¬ 
ing along {ai, di}-paths. After augmenting along {a,, dj}-paths it will augment along 
{bi, Cj}-paths and after augmenting along {bi, Cj}-paths it will augment along {oj+i, dj+i}- 
paths. Due to the choice of the capacities we can see that once the SSP algorithm starts 
augmenting along an {aj,dj}-path it keeps augmenting along {at, dj}-paths until there is 
no Sfc-tfc-path in the residual network that lies completely in the sub-network correspond¬ 
ing to Gk- Also, once the SSP algorithm starts augmenting along an {bi,Ci }-path it keeps 
augmenting along {bi, Cj}-paths until there is no tk-Sk- path in the residual network that lies 
completely in the sub-network corresponding to Gk- After the SSP algorithm augmented 
along the last {a*, dj}-path the residual network in the sub-network corresponding to Gk 
is equal to the residual network of a maximum flow in Gk- After the SSP algorithm aug¬ 
mented along the last {&*, q}- path the residual network in the sub-network corresponding 
to Gk is equal to Gk- We can see that the SSP algorithm augments along an {a*, dj}-path 
for every path P it encounters on Gk and along an {6j,Cj}-path for the backwards path 

i — 

P of every path P it encounters on Gk- Therefore, the SSP-algorithm will augment M 
times along paths corresponding to the paths it encounters on Gk and M times along 
paths corresponding to the backward paths of these paths and therefore augment along 
2.4/ times as many paths in G as in Gk- □ 


To show that G contains 2n + 2fc + 2 + AM nodes and m + 2n + Ak — 4 + 8 M edges, we 
observe that G\ has 2n + 2 nodes and m + 2n edges, the k — 1 iterations to create Gk add 
a total of 2k — 2 nodes and Ak — A edges and the construction of G from Gk adds AM + 2 
nodes and 8 M edges. This gives a total of 2n + 2 + 2k — 2 + AM + 2 = 2n + 2k + 2 + AM 
nodes and m. + 2n + Ak — A + 8M edges. Since k,M = 0{n) and m>n,G has 0(n ) nodes 
and 0{m ) edges and forces the SSP algorithm to encounter m ■ 2 k i ■ 2M = £l(rncj)M ) = 
0(0 • m ■ min(0, n)) paths on G. For 0 = 0(n) this lower bound shows that the upper 
bound of O(mn0) augmentation steps in Theorem [I] is tight. 


6 Smoothed Analysis of the Simplex Algorithm 

In this section we describe a surprising connection between our result about the SSP 
algorithm and the smoothed analysis of the simplex algorithm. Spielman and Teng’s 
original smoothed analysis [23] as well as Vershynin’s [26] improved analysis are based on 
the shadow vertex method. To describe this pivot rule, let us consider a linear program 
with an objective function z T x and a set of constraints Ax < b. Let us assume that a non- 
optimal initial vertex xq of the polytope P of feasible solutions is given. The shadow vertex 
method computes an objective function u T x that is optimized by xq. Then it projects the 
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polytope P onto the 2-dimensional plane that is spanned by the vectors z and u. If we 
assume for the sake of simplicity that P is bounded, then the resulting projection is a 
polygon Q. 

The crucial properties of the polygon Q are as follows: both the projection of xq and the 
projection of the optimal solution x* are vertices of Q, and every edge of Q corresponds to 
an edge of P. The shadow vertex method follows the edges of Q from the projection of xo 
to the projection of x*. The aforementioned properties guarantee that this corresponds to 
a feasible walk on the polytope P. 

To relate the shadow vertex method and the SSP algorithm, we consider the canonical 
linear program for the maximum-flow problem with one source and one sink. In this linear 
program, there is a variable for each edge corresponding to the flow on that edge. The 
objective function, which is to be maximized, adds the flow on all outgoing edges of the 
source and subtracts the flow on all incoming edges of the source. There are constraints 
for each edge ensuring that the flow is non-negative and not larger than the capacity, and 
there is a constraint for each node except the source and the sink ensuring Kirchhoff’s law. 

The empty flow xq is a vertex of the polytope of feasible solutions. In particular, it is 
a feasible solution with minimum costs. Hence, letting u be the vector of edge costs is a 
valid choice in the shadow vertex method. For this choice every feasible flow / is projected 
to the pair (\f\,c(f)). Theorem [3] guarantees that the cost function depicted in Figure [ 2 ] 
forms the lower envelope of the polygon that results from projecting the set of feasible 
flows. There are two possibilities for the shadow vertex method for the first step: it can 
choose to follow either the upper or the lower envelope of this polygon. If it decides for 
the lower envelope, then it will encounter exactly the same sequence of flows as the SSP 
algorithm. 

This means that Theorem [l] can also be interpreted as a statement about the shadow 
vertex method applied to the maximum-flow linear program. It says that for this particular 
class of linear programs, the shadow vertex method has expected polynomial running time 
even if the linear program is chosen by an adversary. It suffices to perturb the costs, which 
determine the projection used in the shadow vertex method. Hence, if the projection is 
chosen at random, the shadow vertex method is a randomized simplex algorithm with 
polynomial expected running time for any flow linear program. 

In general, we believe that it is an interesting question to study whether the strong 
assumption in Spielman and Teng’s |23] and Vershynin’s |2BJ smoothed analysis that all 
coefficients in the constraints are perturbed is necessary. In particular, we find it an 
interesting open question to characterize for which class of linear programs it suffices to 
perturb only the coefficients in the objective function or just the projection in the shadow 
vertex method to obtain polynomial smoothed running time. 

Two of us have studied a related question [3]. We have proved that the shadow vertex 

method can be used to find short paths between given vertices of a polyhedron. Here, 

2 

short means that the path length is O(^p-), where n denotes the number of variables, m 
denotes the number of constraints, and 5 is a parameter that measures the flatness of the 
vertices of the polyhedron. This result is proven by a significant extension of the analysis 
presented in this article. 
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